Stabilizing multivariate normal approximation #380

han-ol · 2025-03-27T15:37:32Z

This PR seeks to adress the stability of estimating mean and covariance using

inference_network=bf.networks.PointInferenceNetwork(
        scores=dict(
            mvn=bf.scores.MultivariateNormalScore()
        )
)
# -> results in warning: MultivariateNormalScore is unstable.

It is fixed by using link function called PositiveDefinite, that is based on the Cholesky decomposition.

paul-buerkner · 2025-03-27T15:40:09Z

Thanks! Is it intentional that this is to be merged into the build-deepset-without-call branch instead of into main (or dev)?

han-ol · 2025-03-27T15:49:55Z

Thanks for pointing that out, it was not intentional. It was actually developed against the current dev branch, so I changed it accordingly.

issue with TensorFlow backend: the new bf.links.PositiveDefinite() relies on bf.utils.fill_triangular_matrix(). For some reason the keras.ops.tril call fails for tensorflow backend claiming "pred must not be a Python bool", without it being clear where such a pred variable is defined.
issue with PyTorch backend: tests for positive definiteness are failing.

I will try to reproduce the error outside of BayesFlow.

han-ol · 2025-03-28T16:30:15Z

Pair programming with Valentin was successful (thx!) and we smoothed out the flaky tests. All backends support estimation of multivariate normal parameters now.

I removed some rough edges, added tests, so hopefully the codecov/path tests pass too now.

The PR is ready for review as soon as tests passed.

EDIT: tests passed, ready for review.

vpratz · 2025-03-28T18:43:23Z

bayesflow/scores/scoring_rule.py

        self.subnets_kwargs = subnets_kwargs or {}
        self.links = links or {}

+        self.not_transforming_like_vector = []


Please add a comment documenting what this should contain, so people who want to set this in a subclass know what to use

+1, I would also like to know if we can handle this in a better way than a special attribute.

Do we want this to be variable within a class, i.e., does this have to be an instance variable? If not, a constant class variable might be an option, like

class ScoringRule: #: This variable lists ... (the #: syntax should allow sphinx to parse this) NOT_TRANSFORMING_LIKE_VECTOR = tuple() # use immutable type tuple instead of list

Good idea to use a class variable for this! Implemented it in d87b0b9.

I added documenting comments as well.

Regarding a better way of handling this in general:

There is a discussion in #304 about the long term plans for special estimators and how they play with adapter transformations.
In my opinion, what we have here suffices for now and is a reasonable first step.

LarsKue

Code overall looks good. See individual comments. I don't know enough about the requirements of this task to give a definite approval, but I trust that @han-ol did their research on this 🙂

Please do not merge before the individual comments are resolved, though.

bayesflow/links/positive_definite.py

bayesflow/networks/point_inference_network.py

LarsKue · 2025-03-29T22:28:46Z

bayesflow/approximators/point_approximator.py

    def _prepare_conditions(self, conditions: dict[str, np.ndarray], **kwargs) -> dict[str, Tensor]:
        """Adapts and converts the conditions to tensors."""
        conditions = self.adapter(conditions, strict=False, stage="inference", **kwargs)
+        conditions.pop("inference_variables", None)


We could add this function to the ContinuousApproximator, if it is identical between it and the Point Approximator

Yes! This and similar refactoring of the ContinuousApproximator is a good idea (but I would keep them out of this PR).
There is also the option of moving the conversion to tensor into the adapter. Possibly with an optional bool flag convert_to_tensor that is by default False.

bayesflow/approximators/point_approximator.py

bayesflow/scores/multivariate_normal_score.py

LarsKue · 2025-03-29T22:33:10Z

bayesflow/scores/scoring_rule.py

        self.subnets_kwargs = subnets_kwargs or {}
        self.links = links or {}

+        self.not_transforming_like_vector = []


+1, I would also like to know if we can handle this in a better way than a special attribute.

bayesflow/scores/scoring_rule.py

tests/test_links/test_links.py

han-ol · 2025-04-01T16:41:29Z

Thank you for your helpful reviews!

I resolved the conversations concerning things that are unequivocally solved. What is left open is for you to check if you are happy with the state of things.

From my side, the PR is ready to merge.

Better parameterization of covariance matrices

34c7f2a

han-ol changed the base branch from main to build-deepset-without-call March 27, 2025 15:38

Fix format string

84ed002

han-ol changed the base branch from build-deepset-without-call to dev March 27, 2025 15:45

han-ol closed this Mar 27, 2025

han-ol reopened this Mar 27, 2025

han-ol added 5 commits March 28, 2025 16:45

Test for invertibility of positive definite link output

fbc01f5

Allow estimation of univariate MVN

eebf950

Remove commented lines

42c6806

Minor changes to comments and docstring for fill_triangular_matrix

d57970a

Test coverage for unconditional MVNScore.sample

ddfdbdc

han-ol self-assigned this Mar 28, 2025

Remove instability warning MultivariateNormalScore

2b38c21

vpratz reviewed Mar 28, 2025

View reviewed changes

LarsKue approved these changes Mar 29, 2025

View reviewed changes

han-ol added 12 commits March 31, 2025 16:32

Remove commented numpy import

1405ee5

Fix dtype of dummy conditions if inference variables are available

f1e1ba1

Tuple conversion in case batch_shape is a list

9d87656

Conversion to numpy before calling numpy operations

4bbbffa

More detailed docs and renamed the transformation warning attribute

fe201aa

Doc string detail

02ea22c

Remove untested comment for PointInferenceNetwork.sample()

9b46601

Relax type hints for ContinuousApproximator.log_prob

5cb8995

Support log-prob in PointApproximator

303127d

Remove comment stating log prob was untested

93e8833

Fix typo

7bfacff

Transformation warning using a class variable; docstring links

d87b0b9

han-ol mentioned this pull request Apr 1, 2025

Correct transformation of second-order tensors in adapters #304

Closed

stefanradev93 merged commit f21e6ef into bayesflow-org:dev Apr 2, 2025
15 checks passed

Stabilizing multivariate normal approximation #380

Stabilizing multivariate normal approximation #380

Uh oh!

Conversation

han-ol commented Mar 27, 2025

Uh oh!

paul-buerkner commented Mar 27, 2025

Uh oh!

han-ol commented Mar 27, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

han-ol commented Mar 28, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

LarsKue left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

han-ol commented Apr 1, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

han-ol commented Mar 27, 2025 •

edited

Loading

han-ol commented Mar 28, 2025 •

edited

Loading

LarsKue left a comment •

edited

Loading